Point Processes Modeling of Time Series 
Exhibiting Power-Law Statistics 



B. Kaulakys*, M. Alaburda* and V. Gontis* 

*Institute of Theoretical Physics and Astronomy of Vilnius University, A. Gostauto 12, LT-01108 

Vilnius, Lithuania 
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INTRODUCTION 

Recently, we proposed [1] and generalized [2] the stochastic point process models 
generating a variety of monofractal and multifractal time series exhibiting power laws 
of the spectrum S{f) and of the distribution P{I) of the signal intensity and applied 
them for the analysis of the financial systems [3]. These models can generate 1// noise 
with a very large Hooge parameter. They may be used as the theoretical framework 
for understanding huge fluctuations (see, e.g., [4]) and as well for description of a 
large variety of observable statistics, i.e., jointly power spectral density (PSD), S{f) ~ 
1//^, signal probability distribution function (PDF), P{I) ~ l/I^, with different slopes, 
different distributions, /'(t), of the interevent time T and different multifractality. 

Here we will present the extensions and generalizations of the point process models 
for the Poissonian-like processes with slowly diffusing mean interevent time [5]. We 
will adjust the parameters of the generalized model to the empirical data of the trading 
activity in the financial markets [6] and to the frequencies of the word occurrences in the 
language, reproducing the PDF and PSD. 

THE MODEL 

We investigate stochastic time series as a sequence of events which occur at discrete 
times t\,t2,...tk, ■■■ and can be considered as identical point events. Such point process 
equivalently is defined by the set of stochastic interevent times = ^yt+i —h- Let us 
consider the flow of events as the Poissonian-like process driven by the multiplicative 
stochastic equation. We define the stochastic rate n = 1 /t of event flow by continuous 
stochastic differential equation 
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where is a standard Wiener process, <7 denotes the standard deviation of the white 
noise, 7 <C 1 is a coefficient of the nonlinear damping and jU defines the power of 
noise multiplicativity. The diffusion of T is restricted from the side of high values by an 

additional term — f O'^ ( :^ j which produces the exponential diffusion reversion. 

m and Tq are the power and value of the diffusion reversion, respectively. The associated 
Fokker-Plank equation with the zero flow gives the simple stationary PDF 



P(t) ~ t"+^ exp 
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with a = 2(7c7 — /i) and Ya = Y/o^. Eq. (1) describes continuous stochastic variable T, 
defines rate n—l/z with stationary distribution and PSD S{f) [5,6], 
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Here we define the fractal point process driven by the stochastic differential equation 
(1), i.e., we assume t(?) as slowly diffusing mean interevent time of the Poissonian-like 
process with the stochastic rate n. Within this assumption the conditional probability of 
interevent time Tp in the Poissonian-like process with the stochastic rate 1/t is 
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Then the long time distribution Pk{'Vp) of interevent time Tp in fc-space [2] has the integral 
form 
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with C defined from the normalization, /J°Pfc(Tp)dTp = 1. The distributions of interevent 
time Tp have their explicit forms for the integer values of power m. For m = 1 and for 
m > 1 they are expressed by the modified Bessel function [5, 6] and in terms of the 
hypergeometric functions, respectively. 



DISCUSSION 

We consider two applications of the proposed model. The frequencies of the word 
occurrences in the language depend on the content and diffuse in the text. We consider 
the flow of words in the text as the steps of discrete events, i.e., one word is the unit 
of the interval. Then the number of other words in between of the two successive 
occurrences of the same noun measures the interevent interval Tp of the point process 
defined for the sequence of selected noun. One can easily calculate the sequence of 
all selected word occurrence intervals Tp and so define the realization of the point 
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FIGURE 1. (a) The power spectral density 5'(/) of the point process for word "eye" occurrences in the 
novels of Jack London. The straight line approximates the power-law with the exponent j3 = 0.7. (b) The 
interevent interval T distribution of the word "eye" occurrences calculated from the histogram in the same 
novels. The smooth line represents the integral formula (6) with m = 2, Tq = 800 and a — —0.3. 



process. Here we demonstrate the statistics of the word "eye" in the selected novels 
of Jack London, over 1.2 mln. words totally. First of all, we demonstrate that the point 
process, defined in such a way, has long memory as the exponent of PSD /3 = 0.7 [Fig. 
1 (a)]. With the assumption of pure multiplicative process with /i = 1 from the relation 
/3 = Ifa — 1 = 1 + a one defines the parameter a = —0.3 related with the probability 
distribution functions. The histogram of Tp distribution coincides with theoretical PDF 
defined by its integral form (6) when m~2 [Fig. 1 (b)]. The exponent A of the power-law 
distribution for number of the word "eye" occurrences in the 1000 words length pieces 
of text is A = 4.4. As we will see later, the presented example of the word statistics 
resembles the statistical properties of trading activity in the financial markets. 

In the case of the financial market we consider every trade of the selected stock as a 
point event, i.e. the sequence of all trades for the stock composes the stochastic point 
process, described by the set of time intervals between the successive trades. The power 
spectral density of the trade sequence serves as a measure of the long range memory 
property of trading activity. An example of the spectrum for the stock CVX trade 
sequence traded in the period of two years on NYSE, Fig. 2 (a), reveals the structure 
of the power spectral density in a wide range of frequencies and shows that the real 
markets exhibit two power laws with the exponents /3i =0.8 and /32 = 0.3. In our recent 
works [5, 6] we have proposed the model adjustment, introducing a new form of the 
modulating stochastic differential equation instead of Eq. (1), 
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where a new parameter e defines the crossover between two areas of T diffusion with 
assumption Tq = 1. The solution of Eq. (7) has to be scaled by Tq for other values of 
Tq. The Poisonian-like point process modulated by Eq. (7) reproduces PSD, Fig.2 (a), of 
the empirical trade sequence in detail, including two exponents and the crossover point, 
Fig.2 (b). The proposed model with the same parameters reproduces the empirical PDF 
of Tp, Fig.2 (c) and (d), very well. Moreover, the model describes the distribution of the 
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FIGURE 2. (a) The power spectral density S{ f) of the stock CVX trade sequence traded on the NYSE, 
(b) Power spectral density of the Poissonian-like process driven by Eq. (7) with the parameters To = 4, 
(J = 0.023, 7 = 0.00034, and £ = 0.06. Straight lines approximate power-law spectrum with exponents 
Pi = 0.8 and j32 = 0.3. (c) The empirical distribution of Tp calculated from the histogram of CVX trades 
on NYSE. The smooth line represents the integral formula (6) for m — 2, Tq = 4 and a — —0.3. (d) The 
model distribution of Xp calculated with same parameters as in (b), the smooth line is the same as in (c). 



empirical trading activity, i.e., the number of transactions per selected time window with 
the power-law exponent X =4.4. 
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